SUMS OF HERMITIAN SQUARES 
AND THE BMV CONJECTURE 

IGOR KLEP AND MARKUS SCHWEIGHOFER 



Abstract. We show that all the coefficients of the polynomial 

tr({A + tB)'") e K[t] 

are nonnegative whenever m < 13 is a nonnegative integer and A and B are 
positive semidefinite matrices of the same size. This has previously been known 
only for m < 7. The validity of the statement for arbitrary m has recently been 
shown to be equivalent to the Bessis-Moussa-Villani conjecture from theoretical 
physics. In our proof, we establish a connection to sums of hcrmitian squares of 
polynomials in noncommuting variables and to semidefinite programming. As 
a by-product we obtain an example of a real polynomial in two noncommuting 
variables having nonnegative trace on all symmetric matrices of the same size, 
yet not being a sum of hermitian squares and commutators. 



1. Introduction 

While attempting to simplify the calculation of partition functions in quantum 
statistical mechanics, Bessis, Moussa and Villani (BMV) conjectured in 1975 [BMV] 
that for any hermitian n x n matrices A and B with B positive semidefinite, the 
function 

is the Laplace transform of a positive measure /i^'^ on M>o- That is, 

Jq 

for all t € M. By Bernstein's theorem, this is equivalent to (p^^^ being completely 
monotone, i.e., 

for all s e No and t e M>o- 

Due to its importance (cf. [BMVl [LiSe] ) there is an extensive literature on this 
conjecture. Nevertheless it has resisted all attempts at proving it. For an overview 
of all the approaches before 1998 leading to partial results, we refer the reader to 
Moussa's survey |Mou| . 



Date: July 31, 2008. 

2000 Mathematics Subject Classification. Primary 11E25, 13J30, 15A90; Secondary 15A45, 
08B20, 90C22. 

Key words and phrases. Bessis-Moussa-Villani (BMV) conjecture, sum of hermitian squares, 
trace inequality, semidefinite programming. 

The first author acknowledges the financial support from the state budget by the Slovenian 
Research Agency (project No. Zl-9570-0101-06). 

Supported by the DFG grant "Barrieren". 

1 



2 



IGOR KLEP AND MARKUS SCHWEIGHOFER 



In 2004, Lieb and Seiringer |LiSe| achieved a breakthrough paving the way to a 
series of new attempts at proving the BMV conjecture. They succeeded in restating 
the conjecture in the following purely algebraic form: 

Conjecture 1.1 (BMV, algebraic form). The polynomial 

p := tr((A + iB)") e R[t] 

has only nonnegative coefficients whenever A and B are n x n positive semidefinite 
matrices. 

The coefficient of t'^ in p is the trace of Sm,k {A, B), the sum of all words of length 
TO in ^ and B in which B appears exactly k times (and therefore A exactly m ~ k 
times). It is easy to see that these coefficients are real for hermitian A, B. 

Suppose A, B are positive semidefinite n x n matrices. For A:<2orTO — fc<2, 
each word appearing in Sm,k{A,B) has nonnegative trace as is easily seen. This 
proves the conjecture for m < 5. For n < 2, A can (as always) be assumed to be 
diagonal and after a diagonal change of basis also B has only nonnegative entries. 
Hence the conjecture is trivial for n < 2. The first nontrivial case (to, k, n) = (6, 3, 3) 
was verified by Hillar and Johnson [HJ] with the help of a computer algebra system 
by considering entries of both 3x3 matrices, A and B, as scalar and therefore 
commuting variables. Hagele [Hag] shifted the focus from scalars to symbolic com- 
putation with matrices (regardless of their size) and gave a surprisingly simple 
argument settling the case (to, k) = (7, 3) and thus also (to, k) = (7, 4) by symme- 
try. Combined with the easy observations from above, this proves Conjecture 11.11 
for to = 7. 

Hagele then deduced the case m — 6, which he could not solve directly with 
his technique, by appealing to the following seminal result due to Hillar |Hil| : If 
Conjecture 11.11 is true for to, then it is also true for all to' < to [Hil) Corollary 
1.8]. A strengthening [Hill Theorem 1.7] of this result (see Section [4] for a precise 
statement) is crucial for our main contribution: 

Theorem 1.2. The BMV Conjecture\rB holds for to < 13. 

We exploit semidefinite programming to find certain certificates for nonnegativity 
of ti{Sm,kiA, B)) which are dimensionless (i.e., valid for all n). These certificates 
are algebraic identities in the ring of polynomials in two noncommuting variables 
involving sums of hermitian squares. The found identities are exact though obtained 
with the help of numerical computations. But they exist only for certain pairs (to, k) 
and we have to rely on Hillar's work to deduce Theorem 11.21 For instance, such a 
sum of hermitian squares certificate does not exist for (to, k) ~ (6, 3), see Example 
[531 

With the benefit of hindsight, Hagele's argument can be read as such a certificate 
for the case (to, fc) = (7,3). However, the certificates we give for (to, fc) = (14,4) 
and (to, k) = (14, 6) are much more involved and seem to be impossible to find by 
hand. 

This paper is organized as follows. Section [2] develops the appropriate algebraic 
framework needed for the desired nonnegativity certificates. In Section [3] the ex- 
istence of such a certificate is transformed into a linear matrix inequality (LMI) 
enabling us to search for these certificates using semidefinite programming (SDP). 
Section [4] explains the overall argument for the proof of Theorem 11.21 The proof 
itself is presented in full detail in Section El A synopsis of our results and other 
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recent developments is given in Section [6l where we also relate the BMV conjec- 
ture to another just as old open problem of Connes on IIi factors. Finally, in the 
appendix we streamline the proof of the mentioned crucial result of Hillar and give 
an alternative argument to prove the BMV conjecture for m = 13 avoiding Hillar's 
theorem. 

2. From matrices to symbols 

The gist of our method is to model the matrices as noncommuting variables 
instead of disaggregating them into scalar entries modeled by commuting variables. 
To this end we introduce the ring of polynomials in two noncommuting variables. 

Remark 2.1. It is easy to see }KS2| Lemma 3.15] that the nonncgativity of 
tr(S'm_fc(A, _B)) for all positive semidcfinitc complex A and B of all sizes need only 
be checked for all positive semidefinite (in particular symmetric) real A and B of 
all sizes (by identifying n x n complex matrices with 2n x 2n real matrices). We 
therefore work over the real numbers. 

We write {X, Y) for the monoid freely generated by X and Y, i.e., {X, Y) consists 
of words in two letters (including the empty word denoted by 1). Let K(X, Y) denote 
the associative K-algebra freely generated by X and Y. The elements of K(X, F) 
are polynomials in the noncommuting variables X and Y with coefficients in R. 
An element of the form aw where ^ a G M and w G (X, Y) is called a monomial 
and a its coefficient. Hence words are monomials whose coefficient is 1. We endow 
M.{X,Y) with the involution p ^ p* fixing R U {X, F} pointwise. Recall that an 
involution has the properties {p + q)* = p* + q* , {pq)* = q*p* and p** = p for all 
p,q & R{X,Y). In particular, for each word w £ {X,Y), w* is its reverse. 

Definition 2.2. Two polynomials /, g e ]R(X, F) arc called cyclically equivalent 
if 5) if / ~ 5 is a sum of commutators in M(X, Y). Here elements of the form 
pq — qp are called commutators [p, q G R{X, Y)). 

This definition reflects the fact that ti-{AB) = tr{BA) for square matrices A and 
B of the same size. The following proposition shows that cyclic equivalence can 
easily be checked and will be used tacitly in the sequel. Part (c) is a special case of 
[KS2( Theorem 2.1] motivating the definition of cyclic equivalence. 

Proposition 2.3. (a) For w, w G {X,Y), we have v w if and only if there are 
^1,^2 G {X,Y) such that v ~ V1V2 and w = V2V1. 

(b) Two polynomials f = J2we{x,Y) ^^w and g = J2we{x.Y} ^^w {an,,bn, G M) are 
cyclically equivalent if and only if for each v G {X, Y) , 

aw = ^ hw 

(c) Suppose f G m.{X,Y) and f* = /. Then f if and only iftr{f{A,B)) = 
for all real symmetric matrices A and B of the same size. 

Definition 2.4. For each subset S C M.{X, Y), we introduce the set 

SymS" -.^ {g e S \ g* ^ g} 
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of its symmetric elements. Elements of the form g*g [g e R(X, y)) are called 
hermitian squares. We denote by 

:= {^5*5. I .9. e nX,Y)} C Sym]R(X,r) 
the convex cone of all sums of hermitian squares and by 

the convex cone of all polynomials that are cyclically equivalent to a sum of her- 
mitian squares. 

The following theorem proved in [Hel| also holds for several variables and moti- 
vates the use of sums of hermitian squares (see |HP| for a survey of recent develop- 
ments). We will only use the easy implication from to 

Theorem 2.5 (Helton). The following are equivalent for f E SymM.{X,Y) : 

(i) / e s^- 

(ii) f{A, B) is positive semidefinite for all n G N and A, B E SymM"^". 

To obtain the desired type of certificates we try to merge Proposition l2.3[( cj) with 
Theorem 12.51 However, such certificates do not always exist. 

Remark 2.6. Consider the following conditions for / e M(X, F): 
(i) / G e^; 

(ii) tr(/(A, B)) > for all neN and A,B e SymM"^". 

Then (P implies jHI but not vice versa. For instance, 

YX*Y + XY^X - 3XY^X + 1 e Sym]R(X, Y) 

satisfies ^ but not jU (see |KS2[ Example 4.4] for details). Later on we will see 
further such examples. 

3. From symbols to matrices 

To search systematically for the certificates just introduced, we develop a non- 
commutative version of the Gram matrix method. The corresponding theory for 
polynomials in commuting variables is well-known and has been studied and used 
extensively, see e.g. |CLR[ IPS] . 

Checking whether a polynomial in noncommuting variables is an clement of 
or O^, respectively, is most efficiently done via the so-called Gram matrix method. 
Given a symmetric / G M.{X, Y) of degree < 2c? and a vector v containing all words 
in X, Y of degree < d, there is a real symmetric matrix G with / = v*Gv. (Here 
V* arises from v by applying the involution entrywise to the transposed vector w*.) 
Every such matrix G is called a Gram matrix for /. Obviously, the set of all Gram 
matrices for / is an affine subspace. 

Example 3.1. Consider the polynomial 

h := X^ + 2XYX + 2X^ + Y^ + 2Y +1 e SymM(X,y). 
Since h has degree four, we choose 

V := [1,X, Y, X^, XY, YX, F^]*. 
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Then every Gram matrix for h has the form 



1 





1 


a 








b 





2 - 2a 











1 





1 





1 - 2b 














a 








1 



































1 

















b 





















e SymR^""^. 



We will revisit this example below. 



From Cholcsky's decomposition wc deduce that / £ SymM.{X,Y) is a sum of 
hermitian squares if and only if it has a positive semidcfinitc Gram matrix. Indeed, 
if G = C*C is a positive semidefinite Gram matrix for /, then / = v*C*Cv = 
{Cv)*{Cv) = J2t9i9t € S2 where g, e M(X,y) is the i-th entry of the vector Cv. 
The converse foUows the same hne of reasoning. 



Example 13.11 continued. There is no positive semidefinite Gram matrix G for h 
since the determinant of the submatrix 



G22 G26 
Ge2 Gee 



2 - 2a 1 
1 



is always negative. Hence h^Y?. 



The existence of a sum of hermitian squares decomposition of / € SymM(X, Y) 
is equivalent to an LMI feasibility problem. As such it can be decided by solving 
the SDP 



minimize tr(G) subject to v*Gv = f, G positive semidefinite. 



Note that v*Gv = / are just linear constraints on the entries of G as one sees by 
comparing coefficients. The objective function G 1-^ tr(G) is often a good choice for 
finding nice low rank matrices G but can be replaced by any other function linear 
in the entries of G. If the polynomial is dense (no sparsity), the dimension of the 
LMI is equal to {2'^+^ - 1) x (2'^+^ - 1). For more on SDP, we refer the reader to 
the survey |Tod| . 

Likewise, checking whether / e 8^ can be done by solving the SDP 



minimize tr(G) subject to v*Gv ~ f, G positive semidefinite. 



By Proposition [2T3l|b|, v*Gv f are again linear constraints on the entries of G. 

For the sake of convenience, from now on a real symmetric matrix G will be 
called a Gram matrix for / g R{X, Y) (with respect to a vector of words v) if 
/ ~'w*G?;. 
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Example 13.11 continued. Every Gram matrix (in tlie new sense) for h has the 
form 



1 


1 




1- ifli 


-02 - 03 


02 


1 i« 

2 2"4 












— — 07 4 


1 ag 


-ag - ag 


1 


03 04 




07 


ag 


ag 





1 - ifll 


aj 




1 


"flio 


aio 


— ^aii — iai2 


— 02 — 03 - 


-Og — 07 + 1 Og 




-aio 


an 





-as 




0-6 ag 




aio 





ai2 


as 


1 In 
.2 5 4 


-as - flg 




— ^aii — ^ai2 


-05 


05 





Setting 04 = 


07 = 1 and all other a.; to zero, 


we get the positive semidefinite 


matrix G = 


[10110 





0] [1 1 


1 


0] with 


corresponding 


representation h {X'^ + Y + 




)2 e I]2, i.e., h G 92. 







In the proof of our main result we will use the Gram matrix method to show 
that certain Sm.ki^'^ 7^"^) ^ 0^- We start by dramatically reducing the sizes of 
corresponding SDPs with a monomial reduction. For this, we need a technical 
lemma. 

Lemma 3.2. Let pi G R{X,Y). 

(a) // for A,B€ SymK"><", tr iJ2^iP^P^)iA, B)) = 0, then pi{A, S) = for all i. 

(b) If J2iPiPi 0; ^'ien = for all i. 

Proof, (a) Denote by ej the canonical basis vectors of M". Then 
= ti-C£{p*p,){A,B)) = ^((p*P.)(AS)e„e,) ^(p,(A, i?)e„p,(A, S)e,). 

Hence Pi{A, B)ej = for all i,j and thus Pi{A, £?) = for all i. 
(b) If 0> then tT{J2,PriA, B)*p,{A, B)) = 0, and by the above, B) = 

for all symmetric A and B of all sizes n. This implies Pi = for all i (see e.g. jKSli 
Proposition 2.3]). □ 

Not only do we drastically reduce the number of words needed in the Gram 
method for Sm,k{X'^ , Y^) but we also impose a block structure on the Gram matrix 
G with blocks Gi. This is done in the following proposition. We use self-explanatory 
notation like {X^, Y^Y for the set of all words that arc concatenations of £ copies 
of ^2 and Y^. 

Proposition 3.3. Fix m, fc G N. 

(a) // m and k are even, set 

Vi := {v G {X^, Y^}^ I deg^ v = m- k, degy v = k} , 

V2 ;= {v G X{X^, y^l^-^X I dcgx v = m~k, degy v ^ k} , 

V3 {v G Y{X^, Y^j'^-^Y \ dcg^ v = m-k, degy v = k} . 

(b) // m is odd and k is even, set 

Vi := |v G I deg^ v = m-k, degy w = fcj , 

V2 := |i' G {X^, y2}^x I dcg^ v = m~k, degy w = fcj . 
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(c) // m and k are odd, set 

Vi := |u e Y{X^, I degx v^m-k, degy w = A:} , 

V2 := |w e {^2, y2 I deg^ -y = m - fc, degy w = fcj . 

(d) // m is even and k is odd, set 

Vi := {v e X{X^, Y^j'^'^Y I dcg^ v = m-k, degy v = k} , 

V2 := {v e y{X^ | deg^ v = m~k, degy v = k} . 

Let Vi denote the vector [w]„gy. . Then Sm,k{X'^, Y'^) £ if and only if there exist 
positive semidefinite matrices Gi S SymR^'^^* such that 

(1) S„,,u{X\Y^)'L^Y.^lG^v^. 

i 

If Gi = C*Ci and Ci € (J^ some index set), then with [pij]j^j. := CiVt we 

have 

(2) S^.u{X\Y^)'l'Y.PhP^'r 

id 

Proof. The second statement is clear since 

i i i ij 

We assume without loss of generality that 1 < k < m — 1. Suppose that 
SmAX^Y^) e 92, i.e., 

(3) s„,ax',y')'^'J2p>^ 

j 

for finitely many Q ^ pj g R{X,Y). Set d := maxj dcgyPj and let Pj be the sum 
of all monomials of degree d with respect to Y appearing in pj . 

Fix real symmetric matrices A and B of the same size. For any real A, we 
have X'^'' tT{Sm,ki^^ ^ B^)) = tr(EjPj (^, AS)*pj (A, AB)). We consider this as an 
equality of real polynomials in A. 

If we assume d > k, then tr{J2 j Pj {A, B)* Pj {A, B)) = since the degree of 
the right hand side polynomial cannot exceed the degree of the left hand side 
polynomial. By (gj) of Lemma [3.21 we get Pj{A,B) = for all j. Since A and 
B were arbitrary, this implies Pj = by Lemma I3.2[|b|) , contradicting the choice 
of d. Therefore all monomials appearing in pj have degree < k in Y. By similar 
arguments, one shows that all pj are actually homogeneous of degree m — k in X and 
homogeneous of degree k in Y, i.e., pj G span^ W where W is the set of all words 
of length m with the letter X appearing m — k times and the letter Y appearing k 
times. 

Claim. Suppose we are in one of the cases (a)-(d) and Vi G Vi for each i. Then 
v*Vj u for some u G {X^, y2|m •£ only li i = j. 

Proof of claim. The "if" part is immediate. To show the "only if" part, we assume 
that i ^ j and show that v*Vj contains YX^Y or XY^X as a subword for some 
odd i. Then the claim follows by Proposition 12. 3f [5]). 

The existence of such a subword must be checked case by case. As an example, 
consider (a). By symmetry arguments, it suffices to look at vlv2 and V2V^. In the 
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former case, the letter at position m + 1 in vlv2 is an X which is followed to the 
left and right hand side by finitely many X^. This block of X's has odd length and 
is embraced at both ends by a K since we have assumed fc > 1. In the latter case, 
there is an X at the m-th and a y at the (m + l)-st position in V2V3. This Y is 
followed to the right hand side by finitely many Y^ giving a block of F's of odd 
length surrounded by X^s. 

The other cases (b)-(d) arc essentially the same, proving the claim. 

Write each pj as pj = J2i Pi,j + 1j where pi,j € span^ Vi and qj e spaujj U with 
U := VF \ Uj Vi- By the claim, p*pj = J2iPi,jPh3 + fj where J2iPi,jPi,j is a hnear 
combination of words that are cyclically equivalent to a word in {X^, y^jm g^^^^ 
is in the linear span of words not cyclically equivalent to a word in {X^, Y^}^\ By 
part (|b| of Proposition [2731 it follows that ^ can be split into 



Pi^jpi,] 



and 



Now let J be the index set consisting of all j and define matrices Ci G R-'^^' 
by [pi,j]jej = CiVi. Then the matrices Gi := C*Ci are positive semidefinite and 
satisfy Q. □ 

We illustrate the proposition by two examples. 

Example 3.4. We have Sg^iX"^ ,Y'^) e 6^. For instance, with 



V3 



[Y^X^Y^X^, Y^X^, X^Y'^X^, Y^X'^Y^, X'^Y'^ , X^Y^X^Y^]\ 

[XY^X^, xy'^x'^y'^x, x^y^xW 

\Y^X^Y, YX'^Y'^X'^Y, YX'^Y^]^ 



and 



Gi = 



4 4 

4 4 


3 3 

1 1 
1 1 



1 1 

1 1 

3 3 


4 4 
4 4 



Gj — — 



■ 1 


-1 



-1 

1 



584(X2,y2) I J2i=l^tG^v^■ Thc matriccs G, which we found using SDP are 
positive semidefinite as can be seen from their characteristic polynomials 

PG2 = 

Alternatively, we can use the Cholesky decompositions Gi = G*Gi for 

"4 4 3 1 1 " 
2\/3 2V3 2V3 
_0 V3 -V3 -\/3 



PG, = 2f - G 



Gi 



G2 = G3 = [1 -1] 



A first nontrivial nonnegativity certificate of this type was found in an ad hoc 
fashion by Hagele |Hag| , namely 

S'7,3(X2,y2) '')^7{Y^x^Yy{Y'^X^Y) + 

7(^^2y2^2y _^ x'^Y^y {X'^Y^ X'^Y + X'^Y^) e 
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This proves Conjecture 11.11 for m ^ 7 (since the cases k < 2 and m — k < 2 are 
trivial and Sj ^a^X"^ ,Y'^) = Si ,z{Y'^ , X'^) e 9^). Note that the representation ^ 
uses only words from Vi of Proposition ESftd) • Hagele also showed that there is 
no such representation for Sq,^{X'^ ,Y'^) using only words from Vi of Proposition 
I3.3tp]) . However, he speculated that admitting more words might lead to such a 
representation meaning in our setup that S'g_3(X^,y^) e O'^. Our next example 
proves that this is not the case. 

Example 3.5. We show that Se^^iX'^ ,Y'^) ^ 6^. Suppose, by way of contradiction, 
that S'6,3(X2,y2) g 02^ rpj^g^ Proposition [S31|d|, with the basis 

V = {y3x3, YX'^Y'^X, XY'^X'^Y, X^Y^} 

we can find a positive semidefinite Gram matrix for Sq_j,{X'^ that is block 
diagonal of the form 



ail 


ai2 





" 


0.12 


022 














bii 


bi2 








bi2 


622 . 



Ge^3 — 



ail ai2 

0-12 022 







6,3 



With V = H„6V, it follows from Se,s.{X^,Y^) v*Ge.3.v that 




2 — 022 6 - ai2 
6 - ai2 6 - an 

For a positive semidefinite matrix of this form, < an < 6, < 022 < 2, 

(5) ai2 < aiia22, 

(6) (6-ai2)2 < (6-an)(2-a22). 
By adding ^ and (O, we obtain 

36 — 12ai2 + 2a^2 < 12 — 2aii — 6022 + 2aiia22- 
As — 2aii — 6a22 + 2aiia22 = a22(aii — 6) + aii(a22 — 2) < 0, this implies 

> a?2 - 6ai2 + 12 = (ai2 - 3)^ + 3, 
a contradiction. Hence S^^^iX'^ ^Y"^) ^ 9^. 

4. Strategy of the proof 



An important ingredient in the proof of Theorem [TT2] will be the following descent 
resuh of Hillar [UIl Theorem 1.7]: 



Theorem 4.1 (Hillar). The failure of Conjecture 11.11 for a certain (m,k) implies 
failure for all (to', k') with m' — k' > m — k and k' > k. 

In view of this theorem it suffices to prove Conjecture 11.11 for (to, k) = (14, 4) 
and (to, k) — (14, 6). To do this we apply our Gram matrix method to prove that 
SuAX^Y^) G 92 and Su,eiX^,Y^) G 9^. 

Since the search for positive semidefinite Gram matrices is done by SDP, the 
entries of the found matrices are only fioating point numbers and do not provide 
a sound proof for the existence of a certificate of nonnegativity. However, in our 
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case, there happen to exist such Gram matrices with rational entries and we have 
employed several strategies and heuristics to find them. 

First, we have detected symmetries and patterns in the numerical solutions and 
imposed them as additional constraints in subsequent SDPs. Second, we have 
worked with different objective functions in order to find solutions with some "nice" 
rational entries that could be fixed. Finally, we have employed rounding techniques 
involving heuristics to guess the prime factors appearing in the denominators of the 
presumably rational entries. All too often, we have however lost numerical stability 
and had to backtrack in this manually guided refinement process. 

For a systematic treatment of finding exact rational sum of squares certificates 
for polynomials in commuting variables we refer the reader to |PPj . see also [Hi2j 
and the references therein. 

5. Proof of Theorem [TT^ 

As mentioned above, it suffices to show that S'i4^4(X^, F^), S'i4,6(-'^^, € 6^ 
(cf. the table on page [15] below) . Let 

i;i4 4 =[Y^X^°Y^, X^Y^X^Y^X^, X^Y^X^, X^Y^X^Y^X^, X'^Y^ X'^Y^ X^ , 

^8y4^2 _^ x^Y'^X^Y^X^, X^Y^X^Y^ + X'^Y^X^Y^, 

X^yi + X^Y^X^Y^ + X^Y^X'^Y^Y 

and 



7 

















7 


7 





7 


7 





7 


7 











7 


14 





7 


7 

















7 


7 


7 


7 


7 





7 


7 


7 


14 


14 


7 


7 





7 


7 


7 


14 


14 


7 


7 


7 








7 


7 


7 


14 


14 


7 








7 


7 


7 


14 


14 



Then Su4X^,Y^) ^ v^^^ 
with Cholesky decomposition Gi4^4 = ^14 4^14.4, where 

il4.4 = V7 



G14.4W14.4. The matrix Gi4_4 is positive semidefinite 



1 

1 1 

1 






1 1 

111 



1 1 





1 1 



We now consider 5*14 g(X^, Y'^). Let A14 g be the symmetric 15 x 15 matrix from 
page [TH and 

iti4^e =[Y^X^Y'^X^Y, YX'^Y^X^Y^X% Y^X^Y^X^Y, YX^Y^X^Y, 

Y^X^Y^X^ Y^X^Y, YX^Y^X^Y, YX^Y^X^Y^X^Y, Y^X^Y^, 

YX^Y^, YX^Y^X^Y^, YX^Y^X^Y, YX^Y^X^Y^, 

YX^Y^X^Y^X^Y, YX^Y^X^Y^Y- 

From the matrices on pages fT3l and [141 we form a symmetric 35 x 35 matrix Bi^^ as 
follows: The top left 18 x 19 block is given by the matrix on page [131 the bottom 
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left 17 X 19 block is given on page [Ml and the other entries are obtained from 

[Bi4:fi]i,j = [Bi4,fi]3e-j,36-i for ij > 19. 

Let 

wi4,6 =[Y^X^Y^X^Y^, Y*X^Y^, Y^X'^Y^X^, Y^X^Y^X^Y^X^ , X^Y^X^Y^X^, 
Y^X^Y^X^Y^X^, Y^X^Y^X^, X^Y^X^Y'^X^, Y^X'^Y'^X^, 
X^Y^X^Y^X^, Y^X^Y^X^Y^X^, Y^X^Y^X^, X^Y^X^, Y^X^Y'^X^, 
Y'^X^Y^X^, Y^X^, X^Y^X^, X^Y^X^Y^X^Y^X^, Y^X^Y^X^Y^, 
X^Y^, X^Y^X^Y^, X^Y^X^Y^, X^Y^X^, X^Y^X^Y"^, 
X^Y^X^Y^X^Y^, X^Y^X^Y'^X^, X^Y'^X^Y^, X^Y'^X^Y^X^, 
X^Y^X^Y^, X^Y^X^Y^X^Y^, X^Y^X^Y^X^, X^Y^X^Y^X^Y^ , 

Then 

(7) Si4,s{X^, Y^) Um_6^14,6W14,6 + ^'14,6-^14,6 ^14,6 • 

Both matrices ^14.3 and i3i4,6 are positive semidefinite as is easily checked by look- 
ing at the corresponding characteristic polynomials using symbolic computation. 
Hence Si4fi{X^,Y^) e 9^. By Theorem 14711 this proves the BMV conjecture for 
m<13. 

Remark 5.1. The word vectors mi4,6 and iui^fi as well as the matrices on pages 
[T2| [T3I and fT4l can be found in the Mathematica notebook that is available with the 
electronic version of the source of this article: 

http : //arxiv . org/abs/0710 . 1074 
In the same file we also provide code that verifies the nonnegativity certificate ([7]) 
when executed. 
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6. Concluding remarks 

6.1. Current state of the BMV conjecture. The foUowing tabic shows the 
examples we have computed on an ordinary PC running Mathematica with the 
NC Algebra package |HMS| . Yalmip [L63 and the SDP solver SeDuMi [Stu] . Most 
of the computations took a few seconds, some of them a few minutes. 



// 





1 

2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 



+ + + 

- + e + 



f + + 
■ + + - 
f + - 

+ + 4- 



+ 



+ 



+ 



©-©-+++ 
+ - + + + 



+ + 



+ 



. ^ + + + 

f ? ? ? + - + + 

? ? ? ? \ LI 



symbol 


meaning 


+ 


Srn.k is in 0^ for trivial reasons 




Srn.k is in 0^ (with proof) 


+ 


Sm.k is in 0^ (numerical evidence) 


© 


Sm,k is not in 0^ (with proof) 




Srn.k is not in 0^ (numerical evidence) 




Legend 



While finishing our paper, Landweber and Speer sent us a closely related preprint 
|LaSp| where they prove for example that 5*^,4(^^,5^^) G 6^ for odd m and that 
SwfiiX"^ .Y"^) e 0^. Their certificates only use words from V\ of Proposition 13.31 
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They also give results on the negative side, which imply by Proposition 16.11 below 
that Y^) ^ 9^ in the following cases: 

(1) m is odd and 5 < fc < m — 5; 

(2) m > 13 is odd and fc = 3; 

(3) m is even, k is odd and 3 < fc < ?7i — 3; 

(4) (TO,fc) = (9,3). 

The compatibility between our setup and the setup of Landweber and Speer |LaSp| 
is provided by the following proposition communicated to us by Eugene Speer. We 
thank him for letting us include this result. 

Proposition 6.1. Retain the notation from Proposition \3.3\ and assume that m or 
k is odd. Then Sra,k{X'^-, Y'^) G if and only if Sm,k{X'^ , Y'^) v^GiVi for some 
positive semidefinite Gi (or equivalently, if and only if Sm,k{X'^ ,Y'^) V2G2V2 for 
some positive semidefinite G2). 

Proof. One direction is trivial and for the converse suppose that S„i^k{X'^ ,Y'^) £ 
62. Then by Proposition ESI SmA^^^Y^) 'E^=l'"*G^v^ for some positive 
semidefinite Gi, G2. Note that w e Vi if and only if w* E V2. Hence, 

V^GiVi = ^ v*{Gi)yuU= ^ w(G'i)^^Z* ''^ ^ Z*{G[).u]zW = V2G[V2, 

V.U^Vl W,Z^V2 W,Z^V2 

where G'l is a positive semidefinite matrix obtained from Gi by a relabelling of rows 
and columns. Thus 

2 

S^AX^Y') ^^^^^^ ^2 (G'l + G2)v2 

i=l 

and similarly Srn,k Y^) vl (Gi + G^)wi . □ 

Independently of the work of Landweber and Speer, the doctoral student Burgdorf 
[Burj ■ initially guided by further numerical experiments, found a combinatorial 
proof of S'™,4(^^ Y"^) e 92 for all m. 

To summarize, the table on page 1151 can be updated as follows: 
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+++©»©+++ 


9 




+++effiffie+++ 


10 




+++effieffie+++ 


11 
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+ + + 
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19 


+ + + 


effieeeeeeeeeeffie+++ 


20 


+ + + e 


©e?e?e?e?e?e®e + + -f 


21 


+ + + e g 


)eeeeeeeeeeeeeffie++ 


22 


+ + + e © 


©?e?e?e?e?e?e®©+4 
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Moreover, the table continues like one would expect from looking at the lines m = 
19, 20, 21, 22. Hillar's descent Theorem 14. II together with positive results for fc = 4 
(by Landwebcr and Speer and, independently, by Burgdorf) establishes Conjecture 
II .11 for fc < 4 and m — k < A. Also, there is still the possibility of proving the BMV 
conjecture in the same manner by replacing a suitable sequence of ?, which only 
occur for even m and fc, by ©. 

Very recently, using analytical methods, Fleischhack [Fle| and, independently, 
Friedland |Frij have shown the following: For fixed positive semidefinite A, B and 
fc S N there is an to' > fc, such that tr Sm,k{A, B) > for all m > m'. If m' could be 
chosen independently of A, B, then Conjecture [TTT] would follow by Hillar's descent 
theorem. 

6.2. Relation to Connes' embedding conjecture. In |KS2j we studied the 
following conditions for real symmetric polynomials / in noncommuting variables 
X := {Xi, . . . , Xr): 

(i) tr(/(Ai, . . .,Ar)) > for aU n e N and aU Ai e SymR"''" with < 1; 

(ii) T(/(ai, . . . , flr)) > for all Ili-factors T and all e Sym with ||ai|| < 1; 

(iii) VeeIR>o3.geM(X): 

f + e%'ge M := g^g, + ^ h*^ (1 - X^)h,, \ 5,, e R{X)}. 

We proved that ^ and (pS)) are equivalent and imply Moreover, we showed 
that the converse implication Q ^ (jll} is equivalent to an old conjecture of Connes 
about type Ili-factors. 

In Example 13.51 we have seen that Se^3{X^ ,Y^) ^ Q^, hence the tracial version 
of Helton's sum of hermitian squares theorem |Hel| fails (cf. also Remark 12. 6p . By 
homogeneity, even S'6,3(X^,y^) + e ^ for all e S R. Similarly, there is no 
g e M with S'6,3(X2,y2) %^ g although Se,3{X^,Y^) satisfies ©. However, it is 
unknown whether Se,3{X^ ,Y^) satisfies ([n]) (or equivalently, jml)). If it does not, 
then Connes' embedding conjecture fails. 

Acknowledgments. We would like to thank Christopher Hillar for introducing the sec- 
ond author to the BMV conjecture at an IMA workshop in Minneapolis. The main part of 
the work was done at the Universitat Konstanz, the former host institution of the second 
author, during a stay of the first author financed by the DFG. Part of the work was also 
done during the Real Algebraic Geometry workshop in Oberwolfach in March 2007. A 
preliminary report of this work appeared in the Oberwolfach reports [KS3j . We would like 
to thank Peter Landweber and Eugene Speer for the careful reading of a previous version 
of the manuscript. They provided us with a detailed list of comments and corrections that 
improved the exposition as well as some of the results. Also, Pierre Moussa and Sabine 
Burgdorf contributed some valuable remarks. Finally, we would like to thank two anony- 
mous referees for their suggestions which greatly contributed to the overall presentation. 

Appendix A. Euler-Lagrange equations 

Hillar's proof of the descent Theorem 14.11 relies on [Hill Corollary 3.6]. In this 
section we prove a similar statement, Lemma I A. !( which can alternatively be used 
to prove the descent theorem by a simple inspection of Hillar's proof. 
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Our proof of Lemma lA.ll uses only Lagrange multipliers and is shorter and 
simpler than Hillar's variational proof of [Hil) Corollary 3.6]. However, the two 
results are not entirely reconcilable. 

For a variational approach to the original form of the BMV conjecture, we refer 
the reader to [LeCj , see also |Mouj . 

Lemma A.l. Given n G N, suppose that (A, B) minimizes tr[Sm.k{A^ , B"^)) among 
all symmetric A, B Cz K"^" of Hilbert- Schmidt norm 1. Suppose further that A and 
B are positive semidefinite. Then 

(8) AS,n-i.k{A\B^) = !!i^tr(5„,fc(A2,B2))A and 

m 

(9) BS^.i,k-M\B'') = -iv{S^,k{A\B''))B. 

m 

Proof. We actually prove more. We fix an arbitrary B E SymK"^" and show that 
([5]) holds when a positive semidefinite matrix A minimizes tT{Sm,kiA'^ , B'^)) among 
all A G SyniM"^" with ||A||hs = 1- Then a corresponding statement will hold for 
([9]) by symmetry. Recall that the Hilbert-Schmidt norm on SymR"^" is induced 
by the scalar product given by {A,B)^s '■= tr{AB) = j AijBij. We use the 
method of Lagrange multipliers and therefore compute the first derivatives of the 
functions f,g : SymR"^" M. given by 

f -.A^triS^^kiA^B^)) and g : A ^ tiiA^) ^ \\A\\ls- 

The derivatives Df{A)[H] and Dg{A)[H] at A € SymR"^" along the direction 
H e Sym K"^" are the coefficients of the linear terms of f{A + XH) and g{A + XH) 
considered as polynomials in A, respectively. Since 

g{A + XH) = tT{{A + XH){A + XH)) = tr{A^) + X{iY{AH) + tr{HA)) + X^ tr{H^), 

we get Dg{A)[H] = tT{AH) + tT{HA) = tT{2AH) = {2A, H), i.e., the gradient of g 
in A is Vg{A) = 2A. 

The calculation of Df(A)[H] is more complicated, but follows the same scheme, 
namely that one occurrence of A^ at a time can be replaced by AH or HA. The 
idea is the same as in the proof of [Hill Lemma 2.1]. We have 

tBy-\{AH + HA) - {AH + HA)){A'^ +tB^)"'-'^ 

HA){A^ +tB^y''^^) - 

tBy~\AH + HA){A^ + tB^)"'-'^ 

and the coefficient of t^ in the last expression is 

tiimiAH + HA)S^.iMA\B^)) ~ Df{A)[H]. 

This implies 

Df{A)[H] = {m{AS^-iMA\B^) + S„,-i.k{A\B^)A),H) 

and therefore V/(A) = miAS^n-Lki^^ B^) + S^^i,,,iA\ B^)A). 

If A is now a minimizer as stated, then we obtain a Lagrange multiplier /i G R 
such that yf{A) = ^iVg{A) (since Vg{A) ^2A^ 0), i.e., 

(10) AS^-i,u{A\B'') + S„,-i,k{A\B^)A = ^lA. 
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We now subtract the two equations that can be obtained from f[TO\i by muhiphca- 
tion with A from the left and right, respectively, and see that commutes with 
Sm-i,k{A^ , B"^). If A is in addition positive scmidefinite, then also A commutes 
with 'Sm-i,kiA'^,B^). Therefore ^ becomes ASrn-i.,k{A'^ , B'^) = f A. Moreover, 

^ = tri^A') = tr{A'S^-,.k{A\B')) = tr{SmAA',B')) 

by [Hill Lemma 2.1]. □ 

Appendix B. Self-contained proof of Conjecture 11.11 for m ~ 13 



Instead of Hillar's descent Theorem 14.11 one can use special features of the found 
nonnegativity certificates for 514^4 (X^, Y"^) and Si4fi{X'^, Y^) to deduce Conjecture 
1.11 for m equal to 13. Wc include this since the ideas might be helpful in future 
algebraic approaches to the BMV conjecture. 

Retain the notation from Section [5l From the Cholesky decomposition of Gi4_4 
we deduce that 



1=1 



for 

.91 

92 

93 

54 



= V7{Y^X^°Y^ + X^Y^X^Y'^ + X^Y^X^Y^ + X^°Y^ + X^Y^X^Y^+ 
X^Y^X^Y^), 

= V7{X'^Y^X^Y^X^ + X^Y^X^ + X^Y'^X^Y'^X'^ + X^Y^X'^+ 
= yj7X''Y^X\ 

= \^{X'^Y'^X^Y'^X^ + X^Y^X^Y^X"^ + X^Y^X'^ + X^Y"^ X^Y"^ X"^ + 

^4y4^6y2 ^ ^2y2^8y2 ^ ;^10y4 _^ X^Y"^ X'^Y'^ + X^Y'^X^Y'^). 

We now turn to Sufi^X"^, Y"^). Let [l]35x35 be the 35 x 35 matrix with all entries 
equal to 1. Then Bn^ — A[l]35x35 is positive semidefinite whenever 
5888894501020664034438572773247271387 _ g 251 
- 6345100314096416989598091089889990510969779 ^ 
As w*4_g[l]35x35Wi4^6 = -Sy^sCA^, F^)^, this implies that for some /ij e K(A), 

(11) S,,,,{X\Y^) 10-'SjAX', Y^f + 

i 

Wc arc now ready to prove Conjecture 11.11 for m = 13. It is easy to see that 
5i3,fc(A2,y2) g 02 foj. ^ g {0,1,2,11,12,13}. Let us consider ^13,3(^2,52) for 
positive semidefinite A,Bg M"^". Suppose there are such A, B with 

(12) tr(5i3,3(A2,i?2)) <0. 

By Lemma lA.li we may without loss of generality assume that A and B satisfy 
® and (HI) (with m = 13 and k = 3). Then ASi2A^^,B^) and BSi2,2iA^ , B^) 
are negative semidefinite, A commutes with 512,3(^2, i?^) and B commutes with 
5i2,2(^',S2). Hence 

513,3(^2,^2) = ^25^^_^(^2^ ^2) ^ ^2 ^^^^^ (^2 ^ ^2^ 



20 



IGOR KLEP AND MARKUS SCHWEIGHOFER 



is negative semidefinite and so is BSi3^3{A^ , B'^)B. By the above, Si4,4:{X^ ,Y^) g 
e^, so 

< tr(5i4,4(A2,i32)) = iitr(i325i3.3(^',S')) = ^triBS^.^^iA^ B')B) < 0. 

(For the first equahty see e.g. [HiTl Lemma 2.1].) As S^A^^ ^'^^) ''^ J2t=ia*9i 
with 33 = V7X^Y^X^ and triSu^A^ , B^)) = 0, A^B^A'^ = by LemmaO In 
particular, tT{{B'^A'^)*{B'^A'')) = 0, hence B'^A^ = 0. Repeating this we obtain 
= ^5/2^ = 0. But then 5'l3,3(A^S2) = 0, contradicting HH). This proves 
the BMV conjecture for (to, fc) G {(13, 3), (13, 10)}. Similarly, the cases (m, fc) = 
(13,4) and (m, fc) ~ (13, 9) can be handled. 

Let us now consider £'13,5(712, B^) for positive semidefinite A,B £ M"^". Sup- 
pose there are such A, B with 

(13) tr(£i3,5(A2,s2)) <0. 

As before, we can deduce that tr{Si4fi{A'^ , B"^)) = 0. From pT|) it follows that 
57,3(^2^^2) ^ gy g])^ ^j^jg implies 52^145 = 0, thus ^3/2^2 = A^B^/"^ = 0. 
Therefore Sis^siA'^ , B'^) = 0, contradicting This settles Conjecture fTTl for 

{m,k) G {(13,5), (13,8)}. To conclude the proof we note that the two remaining 
cases (to, k) = (13, 6) and (to, k) — (13, 7) can be handled similarly. 
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